Towards Reliable Handwritten Character Recognition amid Diacritic Chaos

نویسندگان

  • AbdulRahman O. Ibraheem
  • Odetunji A. Odejobi
چکیده

The orthographies of several languages, such as Yoruba, Arabic, Tshivenda, Ciluba, French, German, and Dutch, use diacritics. These diacritics pose an additional challenge to a character recognition system. Using diacritically-marked uppercase Yoruba letters as a case-study, this paper presents one strategy for addressing this problem. In particular, we present a system for the automatic classification of diacritically-marked handwritten uppercase Yorùbá letters in offline mode. Our approach involves six stages: a pre-processing stage; a segmentation stage for isolating the pertinent Latin letter from the diacritical mark(s); a feature extraction stage where eight geometric properties of the Latin letter are computed; a Bayesian classifier stage where the Latin letter is classified based on the extracted features; a decision tree stage where the diacritical marks are recognized; and a result fusion stage where the results of the two latter stages are combined into a single final class label. A recognition rate of 91.18% was obtained. Also, we introduce three new simple features and a new criterion for comparing the efficacies of features, and show that one of the three features we introduced outperforms four traditional simple moments based on the criterion we introduced.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Off-line Handwritten Arabic Character Recognition: A Survey

The automatic recognition of text on scanned images has several applications such as automatic postal mail sorting and searching in large volume of documents. Although Arabic handwritten text recognition has been addressed by many researchers, it remains a challenging task due to several factors. This paper presents an overview of off-line handwritten Arabic character recognition and summarizes...

متن کامل

Processing and Recognition of Handwritten Documents

Nowadays, the accurate recognition of machine printed characters is considered largely a solved problem. A lot of commercial products are focused towards that direction, achieving high recognition rates. However, handwritten character recognition is comparatively difficult. So, the recognition of handwritten documents is still a subject of active research. In this thesis we studied the processi...

متن کامل

An Approach towards Recognition of Size and Shape Independent Bangla Handwritten Numerals

This paper presents an approach to recognize off-line Bangla numeral. Today there are many OCR used to recognize Bangla numeral. The recognition of handwritten character is still a challenging work in the field of pattern recognition. Numeral recognition in pattern recognition is the process to identify the given character according to the predefined character set. The difficulties of recogniti...

متن کامل

Classifier Fusion Method to Recognize Handwritten Kannada Numerals

Optical Character Recognition (OCR) is one of the important fields in image processing and pattern recognition domain. Handwritten character recognition has always been a challenging task. Only a little work can be traced towards the recognition of handwritten characters for the south Indian languages. Kannada is one such south Indian language which is also one of the official language of India...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012